Picture for Deyi Xiong

Deyi Xiong

A Local Perturbation Theory for Cross-Domain Interference and Recovery in Multi-Domain RL

Add code
Jun 01, 2026
Viaarxiv icon

Mix-MoE: Improving Multilingual Machine Translation of Large Language Models through Mixed MoEs

Add code
May 23, 2026
Viaarxiv icon

DVMap: Fine-Grained Pluralistic Value Alignment via High-Consensus Demographic-Value Mapping

Add code
May 14, 2026
Viaarxiv icon

From Insight to Action: A Novel Framework for Interpretability-Guided Data Selection in Large Language Models

Add code
Apr 28, 2026
Viaarxiv icon

Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models

Add code
Apr 27, 2026
Viaarxiv icon

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Add code
Apr 14, 2026
Viaarxiv icon

DEP: A Decentralized Large Language Model Evaluation Protocol

Add code
Mar 01, 2026
Viaarxiv icon

SOUP: Token-level Single-sample Mix-policy Reinforcement Learning for Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon

Finding the Translation Switch: Discovering and Exploiting the Task-Initiation Features in LLMs

Add code
Jan 16, 2026
Viaarxiv icon

Revisiting Entropy in Reinforcement Learning for Large Reasoning Models

Add code
Nov 08, 2025
Viaarxiv icon